A Mutual Information Based Ensemble Method to Estimate Bayes Error
نویسنده
چکیده
Determining the performance bounds possible with a particular clas-siier or data set is often of great importance in pattern recognition applications. A previously introduced method for Bayes error estimation based on combining multiple classiiers outperforms more traditional estimates of this error in many instances. The accuracy of this estimate, however, relies on the correlation among the classiiers, a quantity that may be diicult to quantify precisely. Addressing this issue, we explore improvements to the ensemble based estimates of the Bayes error by considering information theo-retic issues. More precisely, we use mutual information to determine a \similarity" measure between trained classiiers. This approach provides a more reliable similarity measure than error correlation, and leads to more accurate bounds on classiication error. Application to an artiicial problem with known Bayes error rate and a real world problem involving underwater acoustic data demonstrates the accuracy of this method.
منابع مشابه
Bayes Error Rate Estimation Using Classi¢er Ensembles
Department of Electrical and Computer Engineering, University of Texas, Austin, Texas, USA The Bayes error rate gives a statistical lower bound on the error achievable for a given classification problem and the associated choice of features. By reliably estimating this rate, one can assess the usefulness of the feature set that is being used for classification. Moreover, by comparing the accura...
متن کاملA New Unequal Error Protection Technique Based on the Mutual Information of the MPEG-4 Video Frames over Wireless Networks
The performance of video transmission over wireless channels is limited by the channel noise. Thus many error resilience tools have been incorporated into the MPEG-4 video compression method. In addition to these tools, the unequal error protection (UEP) technique has been proposed to protect the different parts in an MPEG-4 video packet with different channel coding rates based on the rate...
متن کاملLabel-Driven Learning Framework: Towards More Accurate Bayesian Network Classifiers through Discrimination of High-Confidence Labels
Bayesian network classifiers (BNCs) have demonstrated competitive classification accuracy in a variety of real-world applications. However, it is error-prone for BNCs to discriminate among high-confidence labels. To address this issue, we propose the label-driven learning framework, which incorporates instance-based learning and ensemble learning. For each testing instance, high-confidence labe...
متن کاملA Novel Subsampling Method for 3D Multimodality Medical Image Registration Based on Mutual Information
Mutual information (MI) is a widely used similarity metric for multimodality image registration. However, it involves an extremely high computational time especially when it is applied to volume images. Moreover, its robustness is affected by existence of local maxima. The multi-resolution pyramid approaches have been proposed to speed up the registration process and increase the accuracy of th...
متن کاملDetecting Dependencies in Sparse, Multivariate Databases Using Probabilistic Programming and Non-parametric Bayes
Datasets with hundreds of variables and many missing values are commonplace. In this setting, it is both statistically and computationally challenging to detect true predictive relationships between variables and also to suppress false positives. This paper proposes an approach that combines probabilistic programming, information theory, and non-parametric Bayes. It shows how to use Bayesian no...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998